Neural Network Architecture Optimization through Submodularity and Supermodularity

نویسندگان

  • Junqi Jin
  • Ziang Yan
  • Kun Fu
  • Nan Jiang
  • Changshui Zhang
چکیده

Deep learning models’ architectures, including depth and width, are key factors influencing models’ performance, such as test accuracy and computation time. This paper solves two problems: given computation time budget, choose an architecture to maximize accuracy, and given accuracy requirement, choose an architecture to minimize computation time. We convert this architecture optimization into a subset selection problem. With accuracy’s submodularity and computation time’s supermodularity, we propose efficient greedy optimization algorithms. The experiments demonstrate our algorithm’s ability to find more accurate models or faster models. By analyzing architecture evolution with growing time budget, we discuss relationships among accuracy, time and architecture, and give suggestions on neural network architecture design.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Optimization of ICDs' Port Sizes in Smart Wells Using Particle Swarm Optimization (PSO) Algorithm through Neural Network Modeling

Oil production optimization is one of the main targets of reservoir management. Smart well technology gives the ability of real time oil production optimization. Although this technology has many advantages; optimum adjustment or sizing of corresponding valves is still an issue to be solved. In this research, optimum port sizing of inflow control devices (ICDs) which are passive control valves ...

متن کامل

Optimization of Oleuropein Extraction from Olive Leaves using Artificial Neural Network

In this work, the artificial neural networks (ANN) technology was applied to the simulation of oleuropein extraction process. For this technology, a 3-layer network structure is applied, and the operation factors such as  amount  of  flow  intensity  ratio,  temperature,  residence  time,  and  pH  are  used  as  input  variables  of  the network,  whereas  the  extraction  yield  is  considere...

متن کامل

On Approximate Non-submodular Minimization via Tree-Structured Supermodularity

We address the problem of minimizing nonsubmodular functions where the supermodularity is restricted to tree-structured pairwise terms. We are motivated by several real world applications, which require submodularity along with structured supermodularity, and this forms a rich class of expressive models, where the non-submodularity is restricted to a tree. While this problem is NP hard (as we s...

متن کامل

A New Architecture Based on Artificial Neural Network and PSO Algorithm for Estimating Software Development Effort

Software project management has always faced challenges that have often had a great impact on the outcome of projects in future. For this, Managers of software projects always seek solutions against challenges. The implementation of unguaranteed approaches or mere personal experiences by managers does not necessarily suffice for solving the problems. Therefore, the management area of software p...

متن کامل

Neuro-Optimizer: A New Artificial Intelligent Optimization Tool and Its Application for Robot Optimal Controller Design

The main objective of this paper is to introduce a new intelligent optimization technique that uses a predictioncorrectionstrategy supported by a recurrent neural network for finding a near optimal solution of a givenobjective function. Recently there have been attempts for using artificial neural networks (ANNs) in optimizationproblems and some types of ANNs such as Hopfield network and Boltzm...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1609.00074  شماره 

صفحات  -

تاریخ انتشار 2016